Picture for Sebastian Ruder

Sebastian Ruder

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Beg to Differ: Understanding Reasoning-Answer Misalignment Across Languages

Add code
Dec 27, 2025
Viaarxiv icon

MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages

Add code
Sep 30, 2025
Figure 1 for MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages
Figure 2 for MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages
Figure 3 for MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages
Figure 4 for MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages
Viaarxiv icon

Arbiters of Ambivalence: Challenges of Using LLMs in No-Consensus Tasks

Add code
May 28, 2025
Viaarxiv icon

The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs

Add code
Apr 24, 2025
Viaarxiv icon

A Post-trainer's Guide to Multilingual Training Data: Uncovering Cross-lingual Transfer Dynamics

Add code
Apr 23, 2025
Viaarxiv icon

AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in Dialectal Arabic

Add code
Dec 05, 2024
Figure 1 for AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in Dialectal Arabic
Figure 2 for AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in Dialectal Arabic
Figure 3 for AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in Dialectal Arabic
Figure 4 for AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in Dialectal Arabic
Viaarxiv icon

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Add code
Oct 20, 2024
Figure 1 for M-RewardBench: Evaluating Reward Models in Multilingual Settings
Figure 2 for M-RewardBench: Evaluating Reward Models in Multilingual Settings
Figure 3 for M-RewardBench: Evaluating Reward Models in Multilingual Settings
Figure 4 for M-RewardBench: Evaluating Reward Models in Multilingual Settings
Viaarxiv icon

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts

Add code
Aug 15, 2024
Figure 1 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Figure 2 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Figure 3 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Figure 4 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Viaarxiv icon

How Does Quantization Affect Multilingual LLMs?

Add code
Jul 03, 2024
Viaarxiv icon